AITopics | small data set

Collaborating Authors

small data set

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Modern Neural Networks Generalize on Small Data Sets

Neural Information Processing SystemsNov-20-2025, 23:18:01 GMT

In this paper, we use a linear program to empirically decompose fitted neural networks into ensembles of low-bias sub-networks. We show that these sub-networks are relatively uncorrelated which leads to an internal regularization process, very much like a random forest, which can explain why a neural network is surprisingly resistant to overfitting. We then demonstrate this in practice by applying large neural networks, with hundreds of parameters per training observation, to a collection of 116 real-world data sets from the UCI Machine Learning Repository. This collection of data sets contains a much smaller number of training examples than the types of image classification tasks generally studied in the deep learning literature, as well as non-trivial label noise. We show that even in this setting deep neural nets are capable of achieving superior classification accuracy without overfitting.

modern neural network generalize, name change, small data set, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.61)

Add feedback

Reviews: Modern Neural Networks Generalize on Small Data Sets

Neural Information Processing SystemsOct-9-2024, 04:11:44 GMT

This paper presents an interesting idea, which is that deep neural networks are able to maintain reasonable generalization performance, even on relatively small datasets, because they can be viewed as an ensemble of uncorrelated sub-networks. Quality: The decomposition method seems reasonable, except for the requirement for the model and the sub-nets to achieve 100% training accuracy. While there are some datasets where this will be reasonable (often high-dimensional datasets), there are others where such an approach would work very badly. That seems to me a fundamental weakness of the approach, especially if there are datasets of that nature where deep neural nets still perform reasonably well. For a random forest, we have an unweighted combination of base classifiers, but it is a learned combination in the case of the decomposed sub-networks, and the weights are tuned on the training data.

dataset, deep neural network, modern neural network generalize, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.78)

Add feedback

5 Ways to Apply AI to Small Data Sets - KDnuggets

#artificialintelligenceFeb-9-2022, 16:50:29 GMT

However, we only ever hear of using AI to understand big data sets. This is because small data sets are usually easily understood by people, and applying AI to analyze and interpret them isn't necessary. These days, many businesses and manufacturers integrate AI into the production line, slowly creating data scarcity. And unlike big companies, many setups cannot collect massive training sets due to risk, time, and budget limitations. As most companies don't know how to benefit from AI application on small data sets correctly, they blindly apply it to make future predictions based on previous files.

information, small data, training data, (15 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

3 Ways to Better Apply AI to Small Data Sets

#artificialintelligenceOct-4-2021, 14:55:59 GMT

Sample size always plays a role in data science, but there are certain instances where risk, time or expense will limit the size of your data: You can only launch a rocket once; you only have so much time to test a much-needed vaccine; your early-stage startup or B2B company only has a handful of customer data points to work with. And in these small data situations, I've found that companies either avoid data science altogether or they are using it incorrectly. One of the more common issues in applying AI is blindly relying on historical data for predicting future situations -- I call this "assuming the past is the future." A common example of this is when we assume the model that has worked so well for us in previous markets will work the same "magic" when we use it to launch products in a new market. The problem is, our new market -- the future -- is completely different from the past market, which leaves us with poor judgement, incorrect predictions, and lackluster business results.

external data, historical data, small data set, (10 more...)

#artificialintelligence

Industry: Health & Medicine (0.80)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.55)
Information Technology > Artificial Intelligence > Applied AI (0.36)

Add feedback

Modern Neural Networks Generalize on Small Data Sets

Olson, Matthew, Wyner, Abraham, Berk, Richard

Neural Information Processing SystemsFeb-14-2020, 13:10:45 GMT

modern neural network generalize, small data set

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback